Picture for Yang Feng

Yang Feng

Alibaba Group

GeoERM: Geometry-Aware Multi-Task Representation Learning on Riemannian Manifolds

Add code
May 05, 2025
Viaarxiv icon

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Add code
May 05, 2025
Viaarxiv icon

Persona-judge: Personalized Alignment of Large Language Models via Token-level Self-judgment

Add code
Apr 17, 2025
Viaarxiv icon

StruPhantom: Evolutionary Injection Attacks on Black-Box Tabular Agents Powered by Large Language Models

Add code
Apr 14, 2025
Viaarxiv icon

LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented Searchers

Add code
Feb 25, 2025
Viaarxiv icon

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Add code
Jan 07, 2025
Viaarxiv icon

Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation

Add code
Jan 01, 2025
Figure 1 for Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
Figure 2 for Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
Figure 3 for Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
Figure 4 for Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
Viaarxiv icon

Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models

Add code
Nov 29, 2024
Viaarxiv icon

Learning Monotonic Attention in Transducer for Streaming Generation

Add code
Nov 26, 2024
Figure 1 for Learning Monotonic Attention in Transducer for Streaming Generation
Figure 2 for Learning Monotonic Attention in Transducer for Streaming Generation
Figure 3 for Learning Monotonic Attention in Transducer for Streaming Generation
Figure 4 for Learning Monotonic Attention in Transducer for Streaming Generation
Viaarxiv icon

Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration

Add code
Nov 25, 2024
Figure 1 for Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration
Figure 2 for Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration
Figure 3 for Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration
Figure 4 for Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration
Viaarxiv icon